Anonymizing at-home fitness: enhancing privacy and motivation with virtual reality and try-on

Introduction This study aimed to address privacy concerns associated with video conferencing tools used in home-based exercise training. To that end, a method that could anonymize participants' appearances and exercise environments during at-home fitness sessions was proposed. Methods This method combines virtual reality for 3-D human-model rendering using key-points tracking with a virtual try-on system enhanced by UV mapping and instance segmentation. To validate the proposed method, we conducted a user study by recruiting participants to assess effectiveness of virtual reality and virtual try-on in terms of privacy protection, self-confidence, and coaching satisfaction. Results Experimental results demonstrated the effectiveness and improved user experience of using virtual reality or virtual try-on in remote fitness, particularly in enhancing privacy protection and self-confidence with statistical significance. However, no significant differences were noted in coaching satisfaction. Discussion These findings confirmed the efficacy of our proposed approach. We believe that the proposed approach can significantly contribute to the future of remote fitness training, offering a more secure and engaging environment for users, thereby potentially increasing adherence to fitness regimens and overall physical wellbeing.


Introduction
Exercise can improve physical appearance.It is also widely recognized as an essential factor for enhancing and maintaining health in the long term.However, the proportion of individuals engaging in insufficient physical activity remains high worldwide, rendering them more susceptible to non-communicable diseases (1,2).Furthermore, people in most countries faced restrictions of physical activity during the COVID-19 pandemic, resulting in significant reduction of overall physical activity levels (3)(4)(5).Although these restrictions have started to ease, physical activity levels of people have not recovered fully (6,7).
In light of these concerns, home-based exercise programs have gained attention as potential alternatives as they can improve accessibility and save time.Home-based exercise is a useful solution for busy individuals who struggle to balance work and physical activity.For instance, home-based exercise has already been proven to be effective for various groups, such as female caregivers with limited time for exercise (8), frail individuals (9), and geriatric patients with cognitive impairments (10).In response to this demand, various online platforms have emerged and shared exercise videos that can be followed at home, benefiting many users.There is evidence that patients who undergo training using recorded exercise videos following COVID-19 hospitalization show encouraging cardiovascular, respiratory, and functional outcomes (11).In addition, home-based exercises can provide clinical benefits to individuals with severe SARS-CoV-2 infection (12).However, participants in these home-based exercise programs show a tendency to experience difficulty when first attempting such exercises.
Video conferencing technology has gained attention in homebased exercise training as it can provide real-time feedback coaching and supervised exercise programs.Exercise training using video conferencing can also provide motivation and offer personalized exercise programs that overcome distance and time constraints.Online exercise programs can provide guidance for independent workouts and monitor continuous exercise, enabling individuals to exercise conveniently from their own residences.Moreover, exercise becomes possible anytime and anywhere with internet connectivity, thus greatly mitigating limitations of time and location.
Home-based exercise training programs often rely on screen sharing for delivering exercise instructions.However, due to privacy concerns, this method may not be considered ideal.This is because exercise videos can contain personal information and sharing videos from home-based training sessions could potentially lead to unintended exposure of such details.In light of the growing popularity of video conferencing tools, recent research has focused on understanding vulnerabilities and risks associated with data privacy in this context (13, 14).Furthermore, other recent studies (15,16) have highlighted privacy concerns among participants in home-based fitness training, which can lead to reluctance in video sharing.
Individuals participating in home-based exercise training often need to expose their clothing and appearance to others, potentially evoking Social Physique Anxiety (SPA).SPA is an emotional response that arises from concerns about others scrutinizing or judging their physical appearance (17).This anxiety is often triggered when individuals feel their physique does not match societal ideals of an "ideal body" (17).Elevated levels of SPA often correlate with lower adherence to physical activity and decreased participation rates in such programs (18).These individual psychological factors, including privacy concerns, social pressure, and self-consciousness, can make exercise participants hesitant to share their videos, with many preferring not to share their videos for these reasons (16).However, interventions aimed at reducing SPA might significantly enhance motivation and promote behaviors that encourage physical activity (17).Additionally, the concept of self-presentational efficacy (SPE) expectancy, defined as a subjective likelihood of successfully of conveying desired impressions to others, has been identified as a key factor influencing both exercise behavior and SPA (19).Elevating SPE expectancy could potentially be an effective strategy to encourage engagement in physical activity.It directly addresses SPA challenges in homebased exercise programs and aligns with findings that link SPA and SPE to physical activity participation (20).
In support of these findings, Lee et al. (21) have provided compelling evidence on benefits of using avatars in video conferencing.Their study demonstrated that avatars, rather than actual appearances, could enhance participant confidence and engagement.Their findings reveal that using avatars can foster better interactions, alleviate anxiety about appearances and behaviors, and promote increased concentration and willingness to engage in virtual communication.Similarly, observable changes in physical activity (PA) have been noted in virtual environments based on the selection of avatars.For example, it has been found that overweight children randomly allocated to avatars depicting a typical body size demonstrate enhanced performance in a PAcentric video game (22).This cohort also displayed heightened exercise motivation and a more favorable disposition toward PA compared to their counterparts assigned to obese avatars.Likewise, women assigned to slender avatars during engagement in a tennis exergame exhibited heightened PA compared to those assigned to obese avatars (23).These findings are reproducible within men cohorts, wherein an association exists between increased PA and allocation to slim rather than overweight avatars (24).This aligns with the notion that modifying virtual self-representation, such as through avatars, could be an innovative approach to manage SPA and improve exercise adherence.From this perspective, this paper presents a method to explore potential solutions for promoting exercise participation and addressing privacy concerns associated with home-based exercise training.
This study primarily focused on privacy concerns associated with video conference technology used in home-based exercise training programs.Additionally, we aimed to improve privacy protection while exploring approaches to reduce the burden caused by exposing the exercise environment and process, with the goal of enhancing exercise motivation.To achieve this, we first identified vulnerabilities of current video conference technologies used and proposed alternatives to address privacy concerns.Furthermore, we identified factors contributing to perceived burden in an exercise environment and process to suggest mitigation strategies.By doing so, we seek to find safe and effective methods to utilize video conference technologies in home-based exercise training while simultaneously ensuring privacy protection and promoting user engagement during exercise.
This study used a novel approach that could leverage VR technology (3D human-model rendering based on 3D key-points tracking) and virtual try-on network (UV mapping-enhanced instance-segmentation-based virtual try-on network: UVI-VTON) to mitigate constraints associated with user privacy protection effectively while alleviating the burden experienced by users in the context of screen sharing for educational purposes.The remainder of this manuscript is organized as follows.Section 2 presents a review of related research from perspectives of VRbased conference and VR-based exercise.Section 3 introduces applications of VR and UVI-VTON in home-based exercise training.Section 4 presents an empirical study evaluating the usability of the proposed approach.Sections 5 describes statistical analysis results and discussion of the usability survey.Section 6 then summarizes key findings of this study and suggests some future research directions.

Related work
This study was inspired by existing research incorporating VR technologies, which could be broadly categorized into two groups: VR and VTON.

. Virtual reality
Currently, VR technology is gaining attention as a means of promoting and assisting physical activity.The use of VR in exercise has potential to generate more favorable effects on physiological, psychological, and rehabilitative outcomes of individuals than conventional exercise methodologies (25).Prior to the COVID-19 pandemic, these studies were already underway, showcasing the efficacy of virtual reality interventions.For instance, a comparative study between virtual reality game exercises and ball exercises has highlighted superior benefits of VR games in enhancing balance abilities of older adults (26).Arlati et al. have introduced VR-based SocialBike, tailored to improve clinical outcomes in older adults (27).Furthermore, a study focusing on individuals with cardiovascular diseases has demonstrated significant enhancements in body composition, lipid profile, and eating patterns through cardiac rehabilitation within a virtual reality environment in comparison with conventional rehabilitation methods (28).
Since the onset of the COVID-19 pandemic, there has been a resurgence of interest in this research domain.Previous research has indicated that virtual reality exercise and fitness apps can be used as effective strategies for enhancing physical and mental wellbeing during the pandemic period (29).Additionally, during COVID-19 lockdowns, empirical studies have demonstrated that the use of smart applications, including live streaming exercise classes and virtual reality fitness programs, has a positive impact by promoting physical activity (30).The necessity of replacing outdoor exercise with a viable alternative has been increasingly emphasized since the COVID-19 outbreak.VR has stood out as a pivotal alternative to outdoor exercise, demonstrating a substantial capacity to improve cognitive function and mitigate motor disabilities (31).In particular, there have been active discussions on the necessity and effectiveness of alternative exercises in academic circles for individuals facing difficulties with outdoor activities, such as older adults (32), patients with post-COVID-19 condition (33), patients recovering from COVID-related pneumonia (34), physically inactive individuals during the COVID-19 lockdown (30), and patients diagnosed with Parkinson's disease (35).In addition, both specialized and gaming VR can be effective in treating upper-extremity impairments, with specialized VR showing potential for improving balance in patients with neurological conditions (36).In contrast to studies focusing on individuals with specific physical limitations, the study by Ng et al. (37) investigated the impact of VR-or augmented reality (AR)-enhanced training on physical activity in healthy individuals through a meta-analysis to.It demonstrated that VR intervention had substantial effects on physical activity levels, minor effects on performance, but no effects on psychological outcomes.

. Virtual try-on network
VTONs leverage advanced computer-vision and machinelearning techniques to simulate the process of virtually trying on clothing items.By overlaying virtual garments on a user's body, individuals can visualize how different clothing items would look on them without actually wearing them.Recent advancements in deep-learning-based VTONs have shown significant improvements in clothing synthesis performances.For instance, context-driven virtual try-on network (C-VTON) (38) employs discriminators specific to different types of contextual information, allowing enhanced clothing synthesis quality.The dual-branch collaborative transformer (DBCT) (39) utilizes a transformer-based architecture and cross-modal information to improve the virtual try-on process.To address the alignment of target garments to corresponding body parts, a novel global appearance flow estimation model has been proposed, which warps clothing spatially (40).The Dress Code dataset (41) containing images of diverse categories of clothes has also been introduced.This dataset enables the generation of highresolution try-on images.These virtual try-on technologies have been utilized for virtual fitting in online shopping.Several studies have investigated their impacts.Research has shown that attitudes of consumers toward virtual try-on can influence their online purchasing behaviors (42).Furthermore, a review of literature has shown a wide range of psychological (cognitive, emotional, and social) and behavioral consequences associated with virtual try-on in the context of shopping (43).

Methods
The aim of this study was to develop a methodology utilizing deep-learning-based models to safeguard the privacy of individuals engaging in home-based exercises and to encourage exercise participation.Specifically, the methodology consisted of two key features: (1) VR (3D human-model rendering based on 3D keypoints tracking); and (2) virtual try-on network (UVI-VTON).The 3D human model rendering was designed for real-time motion tracking and avatar rendering based on images of individuals engaged in movement.It was generated by predicting joint positions of users using deep-learning algorithms and mapping them to a prebuilt human model.The UVI-VTON was developed to enhance video generation speed compared to the conventional virtual try-on network, enabling real-time video generation while improving video quality.It could generate an image of the user wearing different clothing, allowing them to exercise without being self-conscious about their appearance.
To evaluate the effectiveness and usability of this methodology, we conducted experiments and a usability survey.Experiments involved testing performances of deep-learning models used in real-world scenarios.The usability survey involved evaluating user experiences of the methodology, including user satisfaction and feedback assessments on coaching effectiveness from the perspectives of privacy protection and motivation.In this section, we will describe each of these components in detail, including methods used, results obtained, and implications for future development.

VR: D human-model rendering
The skinned multi person linear (SMPL) model widely used in computer vision was employed as the 3D human model to construct human shapes.The SMPL model comprised a 3D body mesh that could transform shapes of the joints and muscles of a person to resemble their actual appearance depending on body shape and key points.The SMPL model could be rendered from estimated parameters based on real-time image frames.Estimated SMPL parameters could then be used with deep-learning-based 3D key-points estimation models.This instance 3D key points estimation model has been discussed in various ways in the field of computer vision.As shown in Figures 1A, B, HRNet was adopted (44) to detect and track 3D key points of a person from a single image frame.
The SMPL model represents the shape of a person without clothing.Using the generated model as an avatar can potentially make users feel uncomfortable and unpleasant.In this research, to create a more comfortable and confident representation of the user, the avatar representing the user tried on clothing.Furthermore, various clothing designs were implemented with the ability for color selection to reflect individuality of the user to achieve effects discussed in a previous study (45).This demonstrate that increased avatar personalization can lead to greater body ownership, presence, and emotional responses.As shown in Figures 1C, D, the SNUG model ( 46) was adopted to try on clothes over vertices of the SMPL mesh.This model utilizes physical loss to depict dynamic aspects of clothing by taking into account movement speed of the human body, resulting in a more realistic representation of garments.Three types of top and two types of bottom apparel were applied to the SMPL model (47) using this approach.
. UVI-VTON: user image regenerated with reference garments using generative adversarial network For training the UVI-VTON, we captured movements of subjects performing yoga and weight training.These two exercise types were used in our home-based training application.One experimental group composed of 14 males and 7 females who performed weight training.Another experimental group consisted of 6 males and 14 females engaged in yoga, resulting in the collection of exercise videos from a total of 41 participants.Each participant executed 10 prescribed movements specified in Table 1 for a single exercise category, i.e., either yoga or weight training.For each exercise movement, participants alternated between wearing four sets of upper-and lower-body clothing and performed exercise movements iteratively.This process resulted in a total of 40 sets of exercises (1 exercise category × 10 movements × 4 clothing sets) performed by each participant.As shown in Figure 2, these scenes were captured using a setup of eight synchronized cameras (GoPro Hero5) positioned to face the subject, with each video recorded at 30 frames per second.The resulting dataset contained a total of 20,606,496 image frames (8 cameras × 2,575,812 image frames extracted from the exercise videos of 41 people) that captured activities of the 41 participants across the 8 cameras.This dataset was used to train the model.UVI-VTON consisted of three main components: a segmentation predictor, a try-on network for the top (TON-T), and a try-on network for the bottom (TON-B).The segmentation predictor generated a segmentation image from the raw image, where areas for wearing top and bottom garments were indicated explicitly.This segmentation image was utilized by the two try-on networks to try on respective clothing items.
As shown in Figure 3A, the UV field was extracted from the raw image (I raw ) using Densepose (48), enabling pixellevel recognition of human body parts in the image and estimation of their 3D positions and orientations, which were then visualized as density maps.Subsequently, the segmentation predictor applied segmentation maps to this UV field, generating a segmentation image I seg ) that delineated a total of 12 distinct body parts: top, bottom, head, neck, left arm, right arm, left hand, right hand, left leg, right leg, left foot, and right foot.
As shown in Figure 3B, the top garment (G top ) was processed through the garment warping network (GWN), resulting in a warped top-garment image (I top garment ) that was fed to TON-T.Additionally, the raw image (I raw ) was segmented to produce the semantic segmentation image (I seg ), generating parts for preservation (I ).This process simulated a person trying on the top and bottom garments, resulting in I top, bottom pred wherein the person appeared to be wearing the replaced top and bottom apparel.
To try on G bottom , semantic segmentation was utilized and divided into three masks: a mask for G bottom (M bottom garment ), a mask for lower-body skin (M bottom skin ) consisting of both legs, and a    preservation mask (M bottom preserv ) consisting of the top, head, both arms, both hands, and both feet.The GWN was then used to warp G bottom to match the shape of M bottom garment , resulting in I top, bottom garment .This I top pred was then divided into preserved parts (I bottom preserv ) determined by M bottom preserv and regenerated skin parts (I bottom skin ) of the lower body defined by M bottom skin .Consequently, I bottom preserv , I bottom skin , and I bottom garment were combined as inputs to TON-B to predict I bottom pred .To train the proposed network, an objective loss function was constructed by combining four loss components: L2 loss, VGG19-based perceptual loss, second-order smooth constraint loss (49), and adversarial loss computed using the pix2pixHD discriminator (49).

Experiments
To assess the effectiveness of UVI-VTON and VR, participants were recruited to perform and evaluate home-based exercise training programs using the developed application.Test exercises consisted of weight training and yoga guided by a coach such that participants had the opportunity to try out proposed methodologies instructed by the coach.After completing the exercise session, participants responded to a usability survey.

. Participants
Experiments to test UVI-VTON and VR comprised a total of 35 participants, including 5 males and 30 females.These participants aged from 18 to 48 years, with a mean age of 30 (±8) years.Participants were chosen based on selfselection through recruitment advertisements.All participants were enrolled after this study obtained Institutional Review Board (IRB) approval (Sungkyunkwan University IRB approval number: SKKU 2021-12-014).

. Procedures
Participants were assigned to five different teams, each having distinct time schedules.Participants joined teams based on their preferred time slots for the experiment.Simultaneously, efforts were made to minimize disparities in team sizes, aiming to achieve a balanced distribution of members among teams.Each team had six to eight members.Experiments were conducted once a week for 8 weeks.During each session, participants engaged in a total of eight exercise sessions, alternating between four sessions each of weight training and yoga, each lasting 30 min.Two coaches, one specializing in yoga and the other in weight training, participated Frontiers in Public Health frontiersin.org in the experiment.To enhance experiences of participants in homebased exercise training, types and order of exercises were varied over a 4-week period as depicted in Table 1.
The coach, situated in a prepared studio, shared their own video feed with participants through a camera, while participants shared their video feeds using their mobile devices from the comfort of their homes.The coach provided instructions for prescribed exercises in a sequential manner.Upon demonstrating each exercise, participants repeated movements accordingly.Simultaneously, participants adjusted their exercise postures in accordance with instructions from the coach, allowing for personalized training modifications.To assess the impact of utilizing UVI-VTON and VR in home-based exercise training compared to not using any methodology, participants engaged in training sessions employing UVI-VTON and VR as well as sessions where no specific methodology was employed.As shown in Figures 4, 5, when utilizing methodologies, participants activated either UVI-VTON or VR upon request from the coach, thereby sharing their real-time modified visual representations on the screen.

. Usability survey
Participants were asked to respond to a survey based on their own experiences with home-based exercise training after an 8-week period of participation.As depicted in Table 2, the usability survey was categorized under three evaluation factors: privacy protection scale (PPS), self-confidence and motivation scale (SMS), and coaching satisfaction scale (CSS).The PPS questionnaire comprehensively assessed participants' perceived burden of camera recording and the extent to which their privacy was protected.This survey comprised three items rated on a scale of 1 (strongly disagree) to 5 (strongly agree) by participants.The SMS questionnaire measures self-confidence and motivation effects derived from home-based exercise training.This questionnaire consisted of two items rated on a scale of 1 (strongly disagree) to 5 (strongly agree) by participants.The CSS questionnaire measures participants' perceptions of whether their exercise performances were accurately conveyed to the instructor and whether they received appropriate feedback based on their experiences in homebased exercise training.This questionnaire comprised two items rated on a scale of 1 (strongly disagree) to 5 (strongly agree) by participants.The raw data of usability survey is depicted in Table 3. Usability survey data were analyzed using one-way analysis of variance (ANOVA).Significant interactions and main effects were investigated through Bonferroni-corrected pairwise comparisons.A significance level of p < 0.05 was considered to be statistically significant.Precise values of p are reported unless p < 0.001.Descriptive statistics for all data are displayed as mean ± standard deviation (SD) with 95% confidence intervals (CIs).

Results
Results of the ANOVA conducted for PPS, SMS, and CSS of home-based exercise training with VR or UVI-VTON indicated no significant effects, suggesting that both methodologies elicited similar emotional responses.However, when home-based exercise training with VR or UVI-VTON were compared to home-based exercise training without VR or UVI-VTON, significant differences in PPS, SMS, and CSS were observed.
When conducting home-based exercise training, significant differences were observed for question 1 of PPS among the use of UVI-VTON, VR, and no intervention [F (2, 68) = 15.880;p < 0.001].Post-hoc analysis revealed no significant difference (p = 0.095) between the use of UVI-VTON and VR for question 1.However, the use of UVI-VTON showed a significantly higher rating of 0.514 (p = 0.003) than the use of nothing for question 1.The use of VR also exhibited a significantly higher rating of 0.686 (p < 0.001) than the use of nothing for question 1.
When conducting home-based exercise training, significant differences were observed in question 2 of PPS among the use of UVI-VTON, VR, and nothing [F (2, 68) = 20.035;p < 0.001].Posthoc analysis revealed no significant difference (p = 0.069) between the use of UVI-VTON and VR for question 2. However, the use of UVI-VTON showed a significantly higher rating of 0.600 (p < 0.001) than the use of noting for question 2. The use of VR also exhibited a significantly higher rating of 0.743 (p < 0.001) than the use of nothing for question 2.
When conducting home-based exercise training, significant differences were observed in question 3 of PPS among the use of UVI-VTON, VR, and nothing [F (2, 68) = 16.302;p < 0.001].Posthoc analysis revealed no significant difference (p = 1.000) between the use of UVI-VTON and VR for question 3.However, the use of UVI-VTON showed a significantly higher rating of 0.971 (p < 0.001) than the use of nothing for question 3. The use of VR also exhibited a significantly higher rating of 1.000 (p < 0.001) than the use of nothing for question 3.
When conducting home-based exercise training, significant differences were observed in question 4 of SMS among the use of UVI-VTON, VR, and nothing [F (2, 68) = 7.299; p = 0.006].Posthoc analysis revealed no significant difference (p = 1.000) between the use of UVI-VTON and VR for question 4.However, the use of UVI-VTON showed a significantly higher rating of 0.686 (p = 0.0133) than the use of nothing for question 4. The use of VR also exhibited a significantly higher rating of 0.629 (p = 0.042) than the use of nothing for question 4.
When conducting home-based exercise training, significant differences were observed in question 5 of SMS among the use of UVI-VTON, VR, and nothing [F (2, 68) = 8.279; p = 0.003).Posthoc analysis revealed no significant difference (p = 0.506) between the use of UVI-VTON and VR for question 5.However, the use of UVI-VTON showed a significantly higher rating of 0.743 (p = 0.004) than the use of nothing for question 5.The use of VR also exhibited a significantly higher rating of 0.600 (p < 0.039) than the use of nothing for question 5.When conducting home-based exercise training, there were no significant differences in question 6 of CSS among groups using UVI-VTON, VR, and no intervention  participants can help us develop and provide more effective exercise programs.
In this study, three measurement criteria were used to evaluate emotional responses of participants, namely PPS, SMS, and CSS.PPS was used to assess perceptions regarding privacy protection.SMS was used to measure confidence and motivation levels of users and CSS was used to evaluate user satisfaction with exercise posture and feedback.
As seen in Figure 6, significant differences in PPS were observed when comparing the use of UVI-VTON vs. non-use, indicating that UVI-VTON had a positive impact on privacy protection.Similarly, significant differences in PPS were observed between the use of VR vs. non-use, suggesting that VR could also have a positive impact on privacy protection.These research findings demonstrate that the implementation of UVI-VTON or VR can help alleviate concerns regarding user privacy while enhancing trust in exercise programs.Moreover, results indicate the potential of UVI-VTON or VR to improve user exercise experiences and contribution to personal data protection, thus creating a safer and more comfortable exercise environment.Analysis results also revealed no significant differences in PPS between the use of UVI-VTON and VR.This suggests that both methodologies could elicit similar emotional responses in relation to measured criteria for PPS.While UVI-VTON covers all parts except the face, VR obscures all information captured by the camera.One might expect that the PPS would be significantly higher in VR than UVI-VTON owing to its ability to conceal more than just the face.However, when comparing the use of UVI-VTON vs. the use of VR, the absence of a significant difference in questions 4 and 5, as depicted in Figure 7, implies that concerns regarding personal data exposure primarily stem from the exposure of one's body and background while wearing clothing, excluding the face.
From Figure 7, when comparing the use of UVI-VTON vs. non-use, significant differences were observed for SMS comprising questions 4 and 5.This indicates that UVI-VTON has a positive impact on confidence enhancement and motivation.Similarly, significant differences were observed for SMS when comparing the use of VR vs. non-use.This suggests that VR also has a positive impact on confidence enhancement and motivation.These research findings highlight that engaging in exercise while utilizing UVI-VTON or VR technology provides a means to conceal personal appearance, enabling users to perform their exercises with enhanced self-assurance.This aligns with the goal of improving the exercise environment in the proposed home-based exercise training program, which aims to encourage and motivate users to participate in physical activities.Furthermore, analysis results revealed no significant differences in SMS between the use of UVI-VTON and VR.This suggests that both methodologies can elicit similar emotional responses when measured against the criteria for SMS.Similar to findings from the analysis of PPS responses, these results indicate that methodologies capable of obscuring users' bodies except their face and background   Coaching Satisfaction Scale (CSS) 6. Are you satisfied with the quality of the instructions from the coach for your exercise posture and feedback?
have potential to enhance self-confidence and promote exercise participation.Additionally, Figure 7 shows that there are no significant differences among the three cases for CSS comprising question 6.This indicates that there is no qualitative difference in guidance by an exercise coach when compared with delivery of postprocessed exercise videos in home-based exercise training using UVI-VTON and VR against the delivery of raw exercise videos without using these methodologies.

Limitations
This study has a limitation.Since this study was conducted with a demand group for home training, which was dominated by women, it resulted in an unbalanced sample, which might limit the generalizability of results.Therefore, future studies over an extended period with a larger and more diverse participant base while considering factors such as gender, age, and physical attributes need to be conducted to enhance the reliability and potential applicability of this study's results.
top preserv ) and skin parts for regeneration (I top skin ) in the process of trying on the top.Both these images were used as inputs to TON-T.Applied inputs were then synthesized through TON-T, resulting in the predicted top-replaced image (I top pred ).This process could simulate a person trying on the top garment to produce I top pred wherein the person appeared to be wearing the replaced top.To try on G top , semantic segmentation was utilized and divided into three masks: a mask for G top (M top garment ), a mask for upper-body skin (M top skin ) consisting of both arms and neck, and a preservation mask (M top preserv ) consisting of the bottom, head, both hands, both legs, and both feet.The GWN was used to warp G top to match the shape of M top garment , resulting in I top garment .I raw was divided into preserved parts (I top preserv ) determined by M top preserv and regenerated skin parts (I top skin ) of the upper body defined by M top skin .Consequently, I top preserv , I top skin , and I top garment were combined as inputs to TON-T to predict I top pred .TON-B had the same pipeline as TON-T, with the only difference being the reference garments (top or bottom) that users tried on.Similarly, the bottom garment (G bottom ) was processed through the GWN, resulting in a warped bottom-garment image (I bottom garment ) that was then fed to TON-B.Additionally, I top pred was segmented by I seg to generate parts for preservation (I bottom preserv ) and skin parts for regeneration (I bottom skin ) in the process of trying on the bottom.Both these images were then used as inputs to TON-B.These inputs were then synthesized through TON-B, resulting in the predicted top-and-bottom-replaced image (I top, bottom pred

FIGURE
FIGURE Procedure for D human modeling with cloth.Given the raw frame (A), the detected D key points (B) are mapped to a D human model (C) constructed using SMPL model ( ).Subsequently, the SNUG model ( ) is used to apply clothing to the corresponding D human model to render a D human model with cloth (D).
FIGUREExperimental setup with eight installed cameras that are situated at distinct angles and oriented toward the subject.

FIGURE( A )
FIGURE (A) Segmentation predictor: It aims to generate a semantic segmentation image (I seg ) in real time from a raw image (I raw ) by applying the segmentation map to the UV field predicted by Densepose ( ). (B) Overview of the proposed network: The semantic segmentation image (I seg ) is used to try G top on I raw , where the garment warping network (GWN) warps G top to I top garment and inputs it to TON-T.Then, I raw is divided into the parts for preservation (I top preserv ) and skin parts for regeneration (I top skin ), which are used as inputs to TON-T to generate I top pred .Similarly, I seg is employed to try G bottom on I top pred , where the GWN warps G bottom to I bottom garment and inputs it to TON-T.Subsequently, I top pred is divided into the parts for preservation (I bottom preserv ) and skin parts for regeneration (I bottom skin ), which are used as inputs to TON-B to generate I top, bottom pred .
[F (2, 68) = 2.121; p = 0.136].The present study aimed to investigate emotional responses of participants regarding home-based exercise training with or without the use of UVI-VTON or VR.This study is a pioneering effort to develop and apply VTON and VR in home-based exercise training.Understanding impacts of these new methodologies on

FIGURE
FIGURE Home-based exercise training with VR ( D human-model rendering based on D key-points tracking).

Question
Privacy Protection Scale (PPS) 1.Is the user privacy protected effectively during home-based exercise training?2. Is the user privacy adequately ensured during home-based exercise training?3. Is there no burden associated with camera recording for home training purposes?Self-confidence and Motivation Scale (SMS) 4. Did you feel confident during the exercise?5. Did you feel motivated to exercise?

FIGURE
FIGURE Privacy Protection Scale (PPS) consisting of three questions (Question , Question , Question ) for all participants.PPS serves as a tool to assess participants' privacy protection levels based on the presence or absence of UVI-VTON and VR.Vertical error bars represent standard deviation for response for each question.*p < ., ***p < . .

FIGURE
FIGURESelf-confidence and Motivation Scale (SMS) comprising questions and is employed to assess participants' self-confidence and motivation levels with consideration of the presence or absence of UVI-VTON and VR.Additionally, the Coaching Satisfaction Scale (CSS), comprising a single question (Question ), is used to gauge participants' coaching satisfaction level, taking into account the presence or absence of UVI-VTON and VR.Vertical error bar represents standard deviation for response for each question.*p < .; **p < . .

TABLE Order
of exercises in home-based exercise training.

TABLE Survey
questions administered upon completion of home-based exercise training.
Findings of this study show that applying new methodologies to home-based exercise training can improve emotional responses of participants.Specifically, UVI-VTON and VR have positive impacts on privacy protection, confidence enhancement, and exercise motivation.These findings can help promote exercise training participation in the home environment.Advancements made in the home-based exercise training environment through this research are expected to facilitate greater accessibility for a wider population.Furthermore, this study serves as a crucial indicator for developing methodologies aimed for at enhancing home-based exercise training environment.ConclusionsThis research aims to help individuals who rely on homebased exercise, encompassing patients, older adults, and TABLE Raw data of usability survey (mean ± SD).Use of nothing 4.14 ± 0.97 4.11 ± 0.96 3.63 ± 1.40 3.86 ± 1.22 3.89 ± 1.32 4.71 ± 0.53